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Abstract. Classical binomial identities are established by giving probabilistic 
interpretations to the summands. The examples include Vandermonde identity 
and some generalizations. 



1. Introduction 



The evaluation of finite sums involving binomial coefficients appears throughout 
the undergraduate curriculum. Students are often exposed to the identity 



Elementary proofs abound: simply choose x = y = f in the binomial expansion 
of (x + y)". The reader is surely aware of many other proofs, including some 
combinatorial in nature. 

At the end of the previous century, the evaluation of these sums was trivialized 
by the work of H. Wilf and D. Zeilberger j7]. In the preface to the charming book 
[7], the authors begin with the phrase 

You've been up all night working on your new theory, you found the 
answer, and it is in the form that involves factorials, binomial 



and then proceed to introduce the method of creative telescoping. This technique 
provides an automatic tool for the verification of this type of identities. 

Even in the presence of a powerful technique, such as the WZ-method, it is often 
a good pedagogical idea to present a simple identity from many different points of 
view. The reader will find in 1 this approach with the example 




The current paper presents probabilistic arguments for the evaluation of certain 
binomial sums. The background required is minimal. The continuous random vari- 
ables X considered here have a probability density function. This is a nonnegative 
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(1.1) 




coefficients, and so on. 
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function fx{x), such that 

(1.3) Pr{X<x)= f fx{y)dy. 

J —oo 

In particular, fx must have total mass 1. Thus, all computations are reduced to the 
evaluation of integrals. For instance, the expectation of a function of the random 
variable X is computed as 

/oo 
9{y)fx{y)dy. 
-co 

In elementary courses, the rcadcir has been exposed to normal random variables, 
written as X ~ A^(0, 1), with density 

(1.5) fx{x) = 



(1.6) f{x;X) = 



and exponential random variables, with probability density function 

Ae"'^^ for a; > 0; 
otherwise. 



oo 



The examples employed in the arguments presented here have a gamma distribu- 
tion with shape parameter k and scale parameter 6, written as X ~ r(fc, 0). These 
are defined by the density function 

(1.7) fix;k,0)=l^'-''-'''/''^^'^' 

I otherwise. 

Here r(s) is the classical gamma function, defined by 

(1.8) Vis) = I 

Jo 

for Res > 0. Observe that if X ~ r{a,e), then X = BY where Y ~ r(a,l). 
Moreover EX" = 6'^{a)„, where 

(1.9) {a)n = + = a(a + 1) . . . (a + n - 1) 

r(a) 

is the Pochhammer symbol. The main property of these random variables employed 
in this paper is the following: assume ~ T{ki,6) are independent, then 

(1.10) Xi + ... + x„ ~r(fci + ... + fc„,^). 

This follows from the fact that that the density probability function for the sum of 

two independent random variables is the convolution of the individual ones. 
Related random variables include those with a beta distribution 



x''-^!- x)'>-^/B{a,b) forO<x<l; 
otherwise. 



(1.11) fa,b{x) = 

Here B{a, h) is the beta function defined by 

dx 



(1.12) B{a,b)= [ x«-l(l-x)*-^ 

Jo 
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and also the symmetric beta distributed random variable Zc, with density propor- 
tional to (1 — x'^Y^^ for — 1 < a; < 1. The first class of random variables can be 
generated as 

(1-13) Ba,6 = TT^, 

t a + t b 

where Ta and Ff, are independent gamma distributed with shape parameters a and 
b, respectively and the second type is distributed as 1 — 2Bc,c, that is, 

2r, - 



(1.14) Zc = l 

where Tc and are independent gamma distributed with shape parameter c. A 
well-known result is that Ba^t and Ta + Th are independent in (jl.l3p : similarly, 
Tc + and Zc are independent in (|1.14p . 

2. A SUM INVOLVING CENTRAL BINOMIAL COEFFICIENTS 

Many finite sums may be evaluated via the generating function of terms appear- 
ing in them. For instance, a sum of the form 

(2.1) S2(n) = ^ aiGj 

is recognized as the coefficient of in the expansion of /(x)^, where 

CXD 

(2.2) /(x) = ^a,a;^" 

is the generating function of the sequence {at}. Similarly, 

(2.3) Sm{n) = ^ afei---afc,„ 

hA:,7^ — n 

is given by the coefficient of in /(x)"^. The classical example 

1 >^ f2f 



(2-4) — ^ — = y , . 

gives the sums 

<") E(!)(l':n-" 

and 

(2-6) E 

^'1 H ^liin—n 

The powers of (1 — 4a;)~^/^ are obtained from the binomial expansion 

(2.7) (i_4a;)-'^=E^(4^)'' 

j=o ■'' 

where {a)j is the Pochhammer symbol. 

The identity (j2.5p is elementary and there are many proofs in the literature. A 
nice combinatorial proof of (|2.6p appeared in 2006 in this journal [3]. In a more 
recent contribution, G. Chang and C. Xu [5] present a probabilistic proof of these 



, i ) \ n — i 



2fci\ /2fc„\ ^ 2^" F(^+n) 
kiJ"'\k^J n\ F(f) ■ 
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identities. Their approach is elenientary: take m independent Gamma random 
variables ^ r(^, 1) and write 

(2.8) ^If^x) = E 



fcl H h^'m— 



where E denotes the expectation operator. For each random variable Xi, the mo- 
ments are given by 

using Euler's duplication formula for the gamma function 

(2.10) r(2z) = ^2'"-ir(z)r(z + i) 

(see [B], 5.5.5) to obtain the second form. The expression 



n 



for the multinomial coefhcients shows that the right-hand side of (|2.8p is 
(2.12) E ''''■^ 

^"iH yk,n—n 



ki I \ k. 



To evaluate the left-hand side of (12. Sp . recall that the sum of m independent T (i, l) 
has a distribution of r(^, 1). Therefore, the left-hand side of (|2.8p is 



(2.13) 



r(f + n) 



r(T) ■ 

This gives p.6p . The special case m = 2 produces p. 5 



3. More sums involving central binomial coefficients 
The next example deals with the identity 



that appears as entry 4.2.5.74 in |4j. The proof presented here employs the famous 
dissection technique, first introduced by Simpson '8] in the simplification of 

(3.2) i(E(Xi+X2)'"-fE(Xi-X2)2"), 

where Xi, X2 are independent random variables distributed as F l). 
The left-hand side is evaluated by expanding the binomials to obtain 

i(E(Xi + X2)'" + E(Xi - ^2)2") = 



2n /„ X 2n 



fe=0 ^ ^ fe=0 ^ 



A PROBABILISTIC APPROACH TO SOME BINOMIAL IDENTITIES 



5 



This gives 

1 " /2n\ 

-(E(Xi + X2f" + EiXi - Xa)^") - V L, EXf EX2"-2fe 

fc=0 

Using (|2.9p . this reduces to 



(.3) i(E,x.+x,-+E,x,-,v,n.<|#t 

fc=0 \ / \ 

The random variable Xi + X2 is r(l, 1) distributed, so 
(3.4) E(Xi+X2)2" = (2n)!, 

and the random variable Xi — X2 is distributed as {Xi + ^2)^1/27 where Z1/2 
is independent of Xi + X2 and has a symmetric beta distribution with density 
fzi/^i^) = I/tt Vl — z^- In particular, the even moments are given by 

Therefore, 

(3.6) E(Xi - ^2)2" = E(Xi + ^2)2" EZ2;2 - 



It follows that 

(3.7) E(Xi+X2)""+E(Xi-X2)^" = (2n)!+ 

The evaluations and dXT]) imply ([XT]) . 

4. An EXTENSION RELATED TO LEGENDRE POLYNOMIALS 

A key point in the evaluation given in the previous section is the elementary 
identity 



(4.1) ! + (-!)'= = 



2 if fc is even; 
otherwise. 



\ p if r = (mod p); 
otherwise; 



This reduces the number of terms in the sum p.3p from 2n to n. A similar can- 
cellation occurs for any p G N. Indeed, the natural extension of (14.11) is given 

by 

p-i 

(4.2) Y.^=^ 

3=0 

Here uj = e^'^^lv is a complex p-th root of unity. Observe that (|4.2p reduces to (14. ip 
when p = 2. 

The goal of this section is to discuss the extension of (13. ip . The main result 
is given in the next theorem. The Legendre polynomials appearing in the next 
theorem are defined by 
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Theorem 4.1. Let n, p be positive integers. Then 



Proof. Replace the random variable Xi — X2 considered in the previous section, 
by Xi + WX2, where is a complex random variable with uniform distribution 
among the p-th roots of unity. That is, 

(4.5) Pr\W = uj'^] = -, ioTO<e<p-l. 
The identity (li^ gives 

(4.6) EW'^ 



1 if r = (mod p); 
otherwise. 



This is the cancellation alluded above. 

Now proceed as in the previous section to obtain the moments 

(4.7) E{Xi+WX2rP = ^ ('"^')ex(""^'^''EX2^p 



fc=0 

inp)l ^ /2kp\ (2{n - k)p 



kp 

n 

- 22np y y y _ 

A second expression for E{Xi + 14^X2)"^ employs an alternative form of the 
Legendre polynomial P„(x) defined in ()4.3p . 



Proposition 4.2. The Legendre polynomial is given by 

1 r J J 1" 

(4.8) F„(x) = — E (x + Vx^ - l)Xi + {x- ^2^2 _ i)x2 , 

n! L J 

where Xi and X2 are independent F (i, l) random variables. 

Proof. The proof is based on characteristic functions. Compute the sum 



(4.9) Ee*(^+^'^^^' ^1 Ee'^"^-^"^^) 



^ j-n p 

V— E (x + -l)Xi + {x- x/x^ - 1) X2 



n 

k=Q 



The moment generating function for a F (-i, l) random variable is 

(4.10) Ee*-^ = (1-0^^^^- 
This reduces (g^ to 

(1 - t{x + - 1)) "^^^ (1 - t{x - - 1)) "^^^ = (1 - 2tx + t^)- 1/2 
which is the generating function of the Legendre polynomials. □ 
This concludes the proof of Theorem 14.11 □ 
Corollary 4.3. Let 2: be a variable and Fi, F2 as before. Then 

(4.11) E(ri + x^ra)" = n!x"F„ (i(x + x'^) . 
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Proof. This resuh foUows from Proposition 14.21 and the change of variables x i— >■ 
■^{x + x~^), known as the Joukowsky transform. □ 

Replacing x by W^^^ in (|4.11|) and averaging over the values of W gives the 
second expression for K{Xi + WX2)'^^ ■ The proof of Theorem 14. II is complete. 

5. Chu-vandermonde 

The arguments presented here to prove (j2.5l) can be generalized by replacing the 
random variables F (i, l) by two random variables r(ai, 1) with shape parameters 
ai and 02, respectively. The resulting identity is the Chu-Vandermonde theorem. 

Theorem 5.1. Let ai and 02 be positive real numbers. Then 

(ai)fc {a2)n-k (fli + a2)n 



(5-1) E 



fc=0 



fc! {n-k)\ 



The reader will find in |2] a more traditional proof. The paper [TIP describes 
how to find and prove this identity in automatic form. 



Exactly the same argument for (j2.6p provides a multivariable generalization of 
the Chu-Vandermonde identity. 

Theorem 5.2. Let {ai}i<i<„j be a collection of m positive real numbers. Then 

/r o^ (ai)fei (am)fc„ 1 . , , s 

fclH \-km—n 

The final stated result presents a generalization of Theorem 14.11 
Theorem 5.3. Let n, p £ N, a e M.+ and uj = e'^'^P. Then 

{kp)l {{n - k)p)\ pf^^ 

Here ci""* (x) is the Gegenbauer polynomial of degree n and parameter a. 
Proof. Start with the moment representation for the Gegenbauer polynomials 

(5.4) Ci'') (x) = ;^Et,,y (c/(:e + - 1) + V{x - ^/x"^ - 1)) " 

with U and independent r(a, 1) random variables. This representation is proved 
in the same way as the proof for the Legendre polynomial, replacing the exponent 
— 1/2 by and exponent —a. Note that the Legendre polynomials are Gegenbauer 
polynomials with parameter a = ^. This result can also be found in Theorem 3 of 

m- □ 



Note 5.4. The value z = 1 in (jOl) gives 

. . {a)kp ia){n-k)p _ «x£„^(a) f ( ^ 

^'■'^ fro(M! ((n-fc)p)!-p^' r'U 

This is a generalization of Chu-Vandermonde. 
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The techniques presented here may be extended to a variety of situations. Two 
examples ihustrate the type of identities that may be proven. They involve the 
Herniite polynomials defined by 



(5.6) H,,{x)^{~ire 



Theorem 5.5. Let to e N. The Hermite polynomials satisfy 

1„ ha;,„\ _ _„/2 v-^ Hk^{xi) Hk^{x,n) 



(5.7) Ai/„ ±i^_^ =™-«/2 y: 

n' \ A /TO / ^ — ^ 



n\ V \/rn I ^-^ k^ ! km ! 

Proof. Start with the moment representation for the Hermite polynomials 

(5.8) Hn{x) ^ 2''E{x + iNy\ 

where N is normal with mean and variance ^. The details are left to the reader. 

□ 

The moment representation for the Gegenbauer polynomials (j5.4[) yields the final 
result presented here. 

Theorem 5.6. Let to e N. The Gegenbauer polynomials cl^\x) satisfy 

(5.9) Ci^^+-+^-Hx) = J2 Cifix)---Ciy\x). 

1^-^^-^ \^kjn—n 

Remark 5.7. A relation between Gegenbauer and Hermite polynomials is given by 

(5.10) hm 4nCi^^ (^) = -H.^{x). 



This relation allows to recover easily identity (|5.7p from identity (15. 9p . 

The examples presented here, show that many of the classical identities for spe- 
cial functions may be established by probabilistic methods. The reader is encour- 
aged to try this method in his/her favorite identity. 
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